Taxonomic Identity Resolution of Highly Phylogenetically Related Strains and Selection of Phylogenetic Markers by Using Genome-Scale Methods: The Bacillus pumilus Group Case
نویسندگان
چکیده
Bacillus pumilus group strains have been studied due their agronomic, biotechnological or pharmaceutical potential. Classifying strains of this taxonomic group at species level is a challenging procedure since it is composed of seven species that share among them over 99.5% of 16S rRNA gene identity. In this study, first, a whole-genome in silico approach was used to accurately demarcate B. pumilus group strains, as a case of highly phylogenetically related taxa, at the species level. In order to achieve that and consequently to validate or correct taxonomic identities of genomes in public databases, an average nucleotide identity correlation, a core-based phylogenomic and a gene function repertory analyses were performed. Eventually, more than 50% such genomes were found to be misclassified. Hierarchical clustering of gene functional repertoires was also used to infer ecotypes among B. pumilus group species. Furthermore, for the first time the machine-learning algorithm Random Forest was used to rank genes in order of their importance for species classification. We found that ybbP, a gene involved in the synthesis of cyclic di-AMP, was the most important gene for accurately predicting species identity among B. pumilus group strains. Finally, principal component analysis was used to classify strains based on the distances between their ybbP genes. The methodologies described could be utilized more broadly to identify other highly phylogenetically related species in metagenomic or epidemiological assessments.
منابع مشابه
Phylogenetic Diversity of the Bacillus pumilus Group and the Marine Ecotype Revealed by Multilocus Sequence Analysis
Bacteria closely related to Bacillus pumilus cannot be distinguished from such other species as B. safensis, B. stratosphericus, B. altitudinis and B. aerophilus simply by 16S rRNA gene sequence. In this report, 76 marine strains were subjected to phylogenetic analysis based on 7 housekeeping genes to understand the phylogeny and biogeography in comparison with other origins. A phylogenetic tre...
متن کاملCharacterization and Phylogenetic Analysis of Magnaporthe spp. strains on Various Hosts in Iran
Background: Populations of Magnaporthe, the causal agent of rice blast disease, are pathotypically and genetically diverse and therefore their interaction with different rice cultivars and also antagonistic microorganisms are very complicated. Objectives: The objectives of the present study were to characterize phylogenetic relationships of 114 native Magnaporthe strains, isolated from rice a...
متن کاملGenome-Scale Metabolic Network Models of Bacillus Species Suggest that Model Improvement is Necessary for Biotechnological Applications
Background: A genome-scale metabolic network model (GEM) is a mathematical representation of an organism’s metabolism. Today, GEMs are popular tools for computationally simulating the biotechnological processes and for predicting biochemical properties of (engineered) strains.Objectives: In the present study, we have evaluated the predictive power of two ...
متن کاملGenomic insights into the taxonomic status of the Bacillus cereus group
The identification and phylogenetic relationships of bacteria within the Bacillus cereus group are controversial. This study aimed at determining the taxonomic affiliations of these strains using the whole-genome sequence-based Genome BLAST Distance Phylogeny (GBDP) approach. The GBDP analysis clearly separated 224 strains into 30 clusters, representing eleven known, partially merged species an...
متن کاملImproving Phylogeny Reconstruction at the Strain Level Using Peptidome Datasets
Typical bacterial strain differentiation methods are often challenged by high genetic similarity between strains. To address this problem, we introduce a novel in silico peptide fingerprinting method based on conventional wet-lab protocols that enables the identification of potential strain-specific peptides. These can be further investigated using in vitro approaches, laying a foundation for t...
متن کامل